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URLs, protocols, scaleability, etc, - and WORK 

From: Sasha Chislenko (sashal@netcom.com) 
Date: Mon Oct 04 1999 - 12:14:10 PDT 

^ Next message: Sasha Chislenko: "Re: Help requested for Thesis'* 

. Previous message: Robin Mastenbroek: "Help requested for Thesis" 

^ In reply to: Robin Mastenbroek: "Re: filtering URL" 

^ Next in thread: Joaquin Delgado: "Citeseer (Error in URL)" 
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Looks like we are getting serious here... 

Some comments on previous messages: 

I think we are mixing two issues with URLs. 
Persistent identifiers for objects are one thing, and 
delivery protocols are and entirely different subject. 
For many types of objects persistent identifiers 
already exist. Such as ISBNs for printed publications, 
or Social Security Numbers (or their equivalents in 
other countries) for people. 

On the Web, there have been a lot of discussion about URNs. 
Internet Archive ( <archive.org> ) keeps time-sliced 
copies of all Web materials. 

The Distributed Object services looks like a very serious 
work in this direction - and its promoters probably are 
complaining that it isn't taking off because nobody is 
writing compelling applications - even for the areas 
where perfect object identification already exists. 

All this is much easier than, and independent from, good 
scaleable ACF algorithms. 

BTW, there _are_ good scaleable solutions; Firefly handled 
requests in real-time with 200,000 profiles, and I was 
working on doing more, with advisor filtering, usage of 
specialty/controversial items only, and clustering). 

Cross-domain relevance can be computed for topical areas 
or opinion clusters; I plaimed to do it at Firefly, the 
feature-guided rec. server was developed with largely this 
in mind - but again, we never got to it. 

Many applications can empower each other without competition. 
E.g., people with similar tastes in music could be recommended 
to each other on car pool matching service; both music site 
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and car pool site would benefit from it. Same with books, 
dating, etc. 

I would definitely want to know the opinion of my peers 
on CNN in general, not only on particular stories. 

Search engines should definitely work with ACF. 
If I look for restaurants, I want recommendations 
in my city, price range, food type, etc. 
Eventually, with XML encoding, one could ask for 
"an expensive Mexican restaurant in Boston*', but 
quality and style could be still best provided by 
my taste peers. 

There are much better ways to match profiles than 
straight correlation (I'll share some suggestions if 
we ever get practical) - and lots of machine leaming 
algorithms to optimize parameters for any required 
performance. 

On Explicit ratings: Not all users are lazy and cheap; 
many would be happy to give careful feedback, and even 
pay for resources. My first 1993 proposal 
<http://www.lucifer.com/--sasha/articles/MessageRating.html> 
had payments as ratings. They also allow better understanding 
of user's interests. 

A combination of implicit and explicit ratings may work best; 
explicit ratings may explain what various types of user behavior 
mean, and also provide criteria for optimization of algorithms. 

I do not see a single problem that cannot be resolved with a 
small to moderate amount of effort. 

Maybe, the best approach would be to look at the issue from an 
optimistic and constructive point of view, rename all the 
problems that keep being reshuffled here and elsewhere for years, 
into Action Items, structure them for a chosen set of ACF 
applications - and we'll have a blueprint for a good set of 
services. And then get it funded. By the government, if 
the democratic governance is able to recognize the value of 
historical restructuring of global knowledge exchanges (I 
don't think it will), or a private group that, unlike some 
others, will not kill the idea to make a quick buck. 

It shouldn't take much money either. 

I can get the Russian group working on Newsfilter to develop 
a working prototype service at a cost the subscribers to this 
Hst could afford personally... 

Any more problems there? :-) 
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Alexander Chislenko <http://www.lucifer.com/~sasha/home.html> 
Newsfilter page: <http://www.lucifer.com/-sasha/newsfilter_plan.html> 



Posted to the collab@sims.berkeley.edu mailing list. 
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